Cutoff Threshold of Variable Importance in Projection for Variable Selection

نویسندگان

  • Noppamas Akarachantachote
  • N. Akarachantachote
  • S. Chadcham
  • K. Saithanu
چکیده

At present, variable selection turns to prominence since it obviously alleviate a trouble of measuring multiple variables per sample. The partial least squares regression (PLS-R) and the score of Variable Importance in Projection (VIP) are combined together for variable selection. The value of VIP score which is greater than 1 is the typical rule for selecting relevant variables. Due to a constant cutoff threshold is not sometimes suitable for every data structure, a new cutoff threshold for VIP in classification task has been proposed and then compared to the classical one thru the interesting situation simulation. There were 180 situations generated based on four parameters: Percentage of the number of relevant variables, Magnitude of mean difference of relevant variables between two groups, Degree of correlation between relevant variables, and the sample size. The result of this study presents that the new cutoff threshold can improve in identifying relevant variables more than the previous threshold as seeing of good value of the average balanced accuracy in most of situations. AMS Subject Classification: 62H30

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collateral as Signal Variable in a Business Partnership Contract: A Case Study of Refah Kargaran Bank

  There is no convergence in the results of the credit relationship between the financial contracting parties (banks and customers) due to the different importance of the contract variables. Therefore, while examining the impact of the empirical relationship between the variables commercial profit-loss sharing contracts, the dual role of the collateral variable (screening and motivation) is in...

متن کامل

An Overview of the New Feature Selection Methods in Finite Mixture of Regression Models

Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...

متن کامل

Application of a hybrid variable selection method for the classification of rapeseed oils based on H NMR spectral analysis

H nuclear magnetic resonance (NMR) spectroscopy was utilized to distinguish the brands of rapeseed oils. As there are more than four hundreds of NMR variables which can cause the discrimination model redundancy, it is necessary to do effective variable selection. Successive projections algorithm (SPA) executed on the full spectrum only improved a few correct answer rate (CAR) and Cohen’s kappa ...

متن کامل

A Comparison between New Estimation and variable Selectiion method in Regression models by Using Simulation

In this paper some new methods whitch very recently have been introduced for parameter estimation and variable selection in regression models are reviewd. Furthermore , we simulate several models in order to evaluate the performance of these methods under diffrent situation. At last we compare the performance of these methods with that of the regular traditional variable selection methods such ...

متن کامل

Nonparametric variable selection and classification: The CATCH algorithm

In a nonparametric framework, we consider the problem of classifying a categorical response Y whose distribution depends on a vector of predictors X, where the coordinates Xj of X may be continuous, discrete, or categorical. To select the variables to be used for classification, we construct an algorithm which for each variable Xj computes an importance score sj to measure the strength of assoc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014